Dataset statistics
| Number of variables | 25 |
|---|---|
| Number of observations | 300000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 31.2 MiB |
| Average record size in memory | 109.0 B |
Variable types
| Numeric | 11 |
|---|---|
| Categorical | 14 |
zip is highly correlated with lat and 3 other fields | High correlation |
lat is highly correlated with zip and 3 other fields | High correlation |
long is highly correlated with zip and 3 other fields | High correlation |
merch_lat is highly correlated with zip and 3 other fields | High correlation |
merch_long is highly correlated with zip and 3 other fields | High correlation |
hour is highly correlated with category_gas_transport and 1 other fields | High correlation |
category_gas_transport is highly correlated with hour | High correlation |
category_grocery_pos is highly correlated with hour | High correlation |
amt is highly skewed (γ1 = 56.19402201) | Skewed |
hour has 9746 (3.2%) zeros | Zeros |
day has 59974 (20.0%) zeros | Zeros |
Reproduction
| Analysis started | 2022-11-09 00:00:06.786425 |
|---|---|
| Analysis finished | 2022-11-09 00:02:22.719329 |
| Duration | 2 minutes and 15.93 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
| Distinct | 30525 |
|---|---|
| Distinct (%) | 10.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 70.12587797 |
| Minimum | 1 |
|---|---|
| Maximum | 28948.9 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.44 |
| Q1 | 9.66 |
| median | 47.44 |
| Q3 | 83.14 |
| 95-th percentile | 196.311 |
| Maximum | 28948.9 |
| Range | 28947.9 |
| Interquartile range (IQR) | 73.48 |
Descriptive statistics
| Standard deviation | 162.4457824 |
|---|---|
| Coefficient of variation (CV) | 2.316488393 |
| Kurtosis | 7709.998671 |
| Mean | 70.12587797 |
| Median Absolute Deviation (MAD) | 37.45 |
| Skewness | 56.19402201 |
| Sum | 21037763.39 |
| Variance | 26388.63222 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.14 | 142 | < 0.1% |
| 1.12 | 134 | < 0.1% |
| 1.2 | 133 | < 0.1% |
| 1.02 | 133 | < 0.1% |
| 1.22 | 130 | < 0.1% |
| 1.7 | 128 | < 0.1% |
| 1.09 | 127 | < 0.1% |
| 1.16 | 127 | < 0.1% |
| 1.1 | 125 | < 0.1% |
| 3.71 | 124 | < 0.1% |
| Other values (30515) | 298697 |
| Value | Count | Frequency (%) |
| 1 | 49 | < 0.1% |
| 1.01 | 112 | |
| 1.02 | 133 | |
| 1.03 | 116 | |
| 1.04 | 116 | |
| 1.05 | 117 | |
| 1.06 | 97 | |
| 1.07 | 109 | |
| 1.08 | 115 | |
| 1.09 | 127 |
| Value | Count | Frequency (%) |
| 28948.9 | 1 | |
| 27119.77 | 1 | |
| 22768.11 | 1 | |
| 13149.15 | 1 | |
| 11422.83 | 1 | |
| 11371.95 | 1 | |
| 9197.47 | 1 | |
| 8517.38 | 1 | |
| 7886.26 | 1 | |
| 7288.5 | 1 |
| Distinct | 963 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48893.8212 |
| Minimum | 1257 |
|---|---|
| Maximum | 99921 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1257 |
|---|---|
| 5-th percentile | 7439 |
| Q1 | 26292 |
| median | 48193 |
| Q3 | 72042 |
| 95-th percentile | 94569 |
| Maximum | 99921 |
| Range | 98664 |
| Interquartile range (IQR) | 45750 |
Descriptive statistics
| Standard deviation | 26849.73684 |
|---|---|
| Coefficient of variation (CV) | 0.5491437604 |
| Kurtosis | -1.094036271 |
| Mean | 48893.8212 |
| Median Absolute Deviation (MAD) | 23039 |
| Skewness | 0.07646239989 |
| Sum | 1.466814636 × 1010 |
| Variance | 720908368.2 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 82514 | 839 | 0.3% |
| 73754 | 829 | 0.3% |
| 48088 | 817 | 0.3% |
| 34112 | 815 | 0.3% |
| 58569 | 771 | 0.3% |
| 21872 | 759 | 0.3% |
| 72042 | 753 | 0.3% |
| 49628 | 750 | 0.2% |
| 49895 | 739 | 0.2% |
| 38761 | 736 | 0.2% |
| Other values (953) | 292192 |
| Value | Count | Frequency (%) |
| 1257 | 455 | |
| 1330 | 251 | 0.1% |
| 1535 | 120 | < 0.1% |
| 1545 | 221 | 0.1% |
| 1612 | 113 | < 0.1% |
| 1843 | 629 | |
| 1844 | 453 | |
| 2180 | 133 | < 0.1% |
| 2630 | 452 | |
| 2908 | 143 | < 0.1% |
| Value | Count | Frequency (%) |
| 99921 | 1 | < 0.1% |
| 99783 | 364 | |
| 99747 | 3 | < 0.1% |
| 99746 | 118 | < 0.1% |
| 99323 | 622 | |
| 99160 | 714 | |
| 99116 | 2 | < 0.1% |
| 99113 | 260 | 0.1% |
| 99033 | 583 | |
| 98836 | 126 | < 0.1% |
| Distinct | 975 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.53826613 |
| Minimum | 20.0271 |
|---|---|
| Maximum | 66.6933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 20.0271 |
|---|---|
| 5-th percentile | 29.8826 |
| Q1 | 34.6205 |
| median | 39.3543 |
| Q3 | 41.8948 |
| 95-th percentile | 45.8433 |
| Maximum | 66.6933 |
| Range | 46.6662 |
| Interquartile range (IQR) | 7.2743 |
Descriptive statistics
| Standard deviation | 5.076917293 |
|---|---|
| Coefficient of variation (CV) | 0.1317370448 |
| Kurtosis | 0.7906627346 |
| Mean | 38.53826613 |
| Median Absolute Deviation (MAD) | 3.3677 |
| Skewness | -0.1848757267 |
| Sum | 11561479.84 |
| Variance | 25.7750892 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 43.0048 | 839 | 0.3% |
| 36.385 | 829 | 0.3% |
| 42.5164 | 817 | 0.3% |
| 26.1184 | 815 | 0.3% |
| 46.1838 | 771 | 0.3% |
| 38.4121 | 759 | 0.3% |
| 34.2853 | 753 | 0.3% |
| 44.5995 | 750 | 0.2% |
| 46.3535 | 739 | 0.2% |
| 33.4783 | 736 | 0.2% |
| Other values (965) | 292192 |
| Value | Count | Frequency (%) |
| 20.0271 | 370 | |
| 20.0827 | 208 | 0.1% |
| 24.6557 | 590 | |
| 26.1184 | 815 | |
| 26.3304 | 120 | < 0.1% |
| 26.3771 | 135 | < 0.1% |
| 26.4215 | 693 | |
| 26.4722 | 602 | |
| 26.529 | 332 | |
| 26.6939 | 261 | 0.1% |
| Value | Count | Frequency (%) |
| 66.6933 | 3 | < 0.1% |
| 65.6899 | 118 | < 0.1% |
| 64.7556 | 364 | |
| 55.4732 | 1 | < 0.1% |
| 48.8878 | 714 | |
| 48.8856 | 493 | |
| 48.8328 | 367 | |
| 48.6669 | 235 | 0.1% |
| 48.6031 | 732 | |
| 48.4786 | 471 |
| Distinct | 976 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.2547715 |
| Minimum | -165.6723 |
|---|---|
| Maximum | -67.9503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 300000 |
| Negative (%) | 100.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -165.6723 |
|---|---|
| 5-th percentile | -119.0825 |
| Q1 | -96.798 |
| median | -87.4769 |
| Q3 | -80.1752 |
| 95-th percentile | -73.5365 |
| Maximum | -67.9503 |
| Range | 97.722 |
| Interquartile range (IQR) | 16.6228 |
Descriptive statistics
| Standard deviation | 13.73774947 |
|---|---|
| Coefficient of variation (CV) | -0.1522107833 |
| Kurtosis | 1.838306623 |
| Mean | -90.2547715 |
| Median Absolute Deviation (MAD) | 8.1276 |
| Skewness | -1.145172422 |
| Sum | -27076431.45 |
| Variance | 188.7257604 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -108.8964 | 839 | 0.3% |
| -98.0727 | 829 | 0.3% |
| -82.9832 | 817 | 0.3% |
| -81.7361 | 815 | 0.3% |
| -101.2589 | 771 | 0.3% |
| -75.2811 | 759 | 0.3% |
| -82.7243 | 758 | 0.3% |
| -91.3336 | 753 | 0.3% |
| -86.2141 | 750 | 0.2% |
| -86.6345 | 739 | 0.2% |
| Other values (966) | 292170 |
| Value | Count | Frequency (%) |
| -165.6723 | 364 | |
| -156.292 | 118 | < 0.1% |
| -155.488 | 208 | |
| -155.3697 | 370 | |
| -153.994 | 3 | < 0.1% |
| -133.1171 | 1 | < 0.1% |
| -124.4409 | 222 | |
| -124.2174 | 372 | |
| -124.1587 | 228 | |
| -124.1437 | 336 |
| Value | Count | Frequency (%) |
| -67.9503 | 474 | |
| -68.5565 | 235 | 0.1% |
| -69.2675 | 107 | < 0.1% |
| -69.4828 | 501 | |
| -69.9576 | 122 | < 0.1% |
| -69.9656 | 647 | |
| -70.1031 | 4 | < 0.1% |
| -70.239 | 194 | 0.1% |
| -70.239 | 45 | < 0.1% |
| -70.3001 | 452 |
city_pop
Real number (ℝ≥0)
| Distinct | 873 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 88514.15292 |
| Minimum | 23 |
|---|---|
| Maximum | 2906700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 23 |
|---|---|
| 5-th percentile | 139 |
| Q1 | 741 |
| median | 2435 |
| Q3 | 20328 |
| 95-th percentile | 518429 |
| Maximum | 2906700 |
| Range | 2906677 |
| Interquartile range (IQR) | 19587 |
Descriptive statistics
| Standard deviation | 302042.5067 |
|---|---|
| Coefficient of variation (CV) | 3.412363975 |
| Kurtosis | 37.96721111 |
| Mean | 88514.15292 |
| Median Absolute Deviation (MAD) | 2180 |
| Skewness | 5.622829603 |
| Sum | 2.655424588 × 1010 |
| Variance | 9.122967585 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 606 | 1291 | 0.4% |
| 1312922 | 1201 | 0.4% |
| 1595797 | 1165 | 0.4% |
| 1766 | 1026 | 0.3% |
| 241 | 1008 | 0.3% |
| 2906700 | 985 | 0.3% |
| 1126 | 966 | 0.3% |
| 302 | 956 | 0.3% |
| 198 | 953 | 0.3% |
| 276002 | 935 | 0.3% |
| Other values (863) | 289514 |
| Value | Count | Frequency (%) |
| 23 | 472 | |
| 37 | 240 | 0.1% |
| 43 | 470 | |
| 46 | 685 | |
| 47 | 114 | < 0.1% |
| 49 | 232 | 0.1% |
| 51 | 242 | 0.1% |
| 52 | 123 | < 0.1% |
| 53 | 644 | |
| 60 | 219 | 0.1% |
| Value | Count | Frequency (%) |
| 2906700 | 985 | |
| 2504700 | 466 | 0.2% |
| 2383912 | 118 | < 0.1% |
| 1595797 | 1165 | |
| 1577385 | 587 | |
| 1526206 | 811 | |
| 1382480 | 472 | 0.2% |
| 1312922 | 1201 | |
| 1263321 | 847 | |
| 1241364 | 587 |
| Distinct | 297367 |
|---|---|
| Distinct (%) | 99.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.53994798 |
| Minimum | 19.027849 |
|---|---|
| Maximum | 67.397018 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 19.027849 |
|---|---|
| 5-th percentile | 29.74189105 |
| Q1 | 34.7378955 |
| median | 39.368308 |
| Q3 | 41.95946525 |
| 95-th percentile | 46.02228925 |
| Maximum | 67.397018 |
| Range | 48.369169 |
| Interquartile range (IQR) | 7.22156975 |
Descriptive statistics
| Standard deviation | 5.111003209 |
|---|---|
| Coefficient of variation (CV) | 0.1326157267 |
| Kurtosis | 0.7749178973 |
| Mean | 38.53994798 |
| Median Absolute Deviation (MAD) | 3.402611 |
| Skewness | -0.1825545615 |
| Sum | 11561984.4 |
| Variance | 26.12235381 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 39.24133 | 3 | < 0.1% |
| 38.556877 | 3 | < 0.1% |
| 38.986749 | 3 | < 0.1% |
| 38.716326 | 3 | < 0.1% |
| 40.137418 | 3 | < 0.1% |
| 41.930416 | 3 | < 0.1% |
| 38.933501 | 3 | < 0.1% |
| 43.327495 | 3 | < 0.1% |
| 43.479957 | 3 | < 0.1% |
| 42.890771 | 3 | < 0.1% |
| Other values (297357) | 299970 |
| Value | Count | Frequency (%) |
| 19.027849 | 1 | |
| 19.0419 | 1 | |
| 19.044747 | 1 | |
| 19.045277 | 1 | |
| 19.048124 | 1 | |
| 19.060875 | 1 | |
| 19.062888 | 1 | |
| 19.072289 | 1 | |
| 19.07844 | 1 | |
| 19.079607 | 1 |
| Value | Count | Frequency (%) |
| 67.397018 | 1 | |
| 67.188111 | 1 | |
| 66.682905 | 1 | |
| 66.679297 | 1 | |
| 66.67154 | 1 | |
| 66.664673 | 1 | |
| 66.624674 | 1 | |
| 66.609525 | 1 | |
| 66.599806 | 1 | |
| 66.595782 | 1 |
| Distinct | 298924 |
|---|---|
| Distinct (%) | 99.6% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | -90.25348029 |
| Minimum | -166.670685 |
|---|---|
| Maximum | -66.950902 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 300000 |
| Negative (%) | 100.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -166.670685 |
|---|---|
| 5-th percentile | -119.2618549 |
| Q1 | -96.908114 |
| median | -87.4729205 |
| Q3 | -80.27720875 |
| 95-th percentile | -73.3890317 |
| Maximum | -66.950902 |
| Range | 99.719783 |
| Interquartile range (IQR) | 16.63090525 |
Descriptive statistics
| Standard deviation | 13.75125306 |
|---|---|
| Coefficient of variation (CV) | -0.1523625794 |
| Kurtosis | 1.830671472 |
| Mean | -90.25348029 |
| Median Absolute Deviation (MAD) | 8.223297 |
| Skewness | -1.141771337 |
| Sum | -27076044.09 |
| Variance | 189.0969607 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| -83.06618 | 3 | < 0.1% |
| -82.159848 | 3 | < 0.1% |
| -95.672843 | 2 | < 0.1% |
| -123.159512 | 2 | < 0.1% |
| -74.700957 | 2 | < 0.1% |
| -86.618196 | 2 | < 0.1% |
| -75.174779 | 2 | < 0.1% |
| -94.441692 | 2 | < 0.1% |
| -84.631577 | 2 | < 0.1% |
| -80.745822 | 2 | < 0.1% |
| Other values (298914) | 299978 |
| Value | Count | Frequency (%) |
| -166.670685 | 1 | |
| -166.655425 | 1 | |
| -166.654993 | 1 | |
| -166.649771 | 1 | |
| -166.648577 | 1 | |
| -166.639673 | 1 | |
| -166.633523 | 1 | |
| -166.632072 | 1 | |
| -166.629875 | 1 | |
| -166.627418 | 1 |
| Value | Count | Frequency (%) |
| -66.950902 | 1 | |
| -66.955996 | 1 | |
| -66.960745 | 1 | |
| -66.963918 | 1 | |
| -66.96411 | 1 | |
| -66.971614 | 1 | |
| -66.977475 | 1 | |
| -66.985361 | 1 | |
| -66.986039 | 1 | |
| -66.989254 | 1 |
age
Real number (ℝ≥0)
| Distinct | 82 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 48.69269667 |
| Minimum | 17 |
|---|---|
| Maximum | 98 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 17 |
|---|---|
| 5-th percentile | 25 |
| Q1 | 35 |
| median | 47 |
| Q3 | 60 |
| 95-th percentile | 83 |
| Maximum | 98 |
| Range | 81 |
| Interquartile range (IQR) | 25 |
Descriptive statistics
| Standard deviation | 17.39218856 |
|---|---|
| Coefficient of variation (CV) | 0.3571826938 |
| Kurtosis | -0.1757988633 |
| Mean | 48.69269667 |
| Median Absolute Deviation (MAD) | 12 |
| Skewness | 0.6096193579 |
| Sum | 14607809 |
| Variance | 302.488223 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 50 | 10472 | 3.5% |
| 38 | 9177 | 3.1% |
| 37 | 8766 | 2.9% |
| 35 | 8636 | 2.9% |
| 48 | 7934 | 2.6% |
| 46 | 7503 | 2.5% |
| 49 | 7251 | 2.4% |
| 36 | 7234 | 2.4% |
| 32 | 7155 | 2.4% |
| 44 | 7152 | 2.4% |
| Other values (72) | 218720 |
| Value | Count | Frequency (%) |
| 17 | 500 | 0.2% |
| 18 | 1844 | 0.6% |
| 19 | 965 | 0.3% |
| 20 | 2 | < 0.1% |
| 21 | 1344 | 0.4% |
| 22 | 2262 | 0.8% |
| 23 | 4483 | |
| 24 | 2964 | |
| 25 | 6924 | |
| 26 | 1430 | 0.5% |
| Value | Count | Frequency (%) |
| 98 | 136 | < 0.1% |
| 97 | 2 | < 0.1% |
| 96 | 1440 | |
| 95 | 1071 | |
| 94 | 920 | |
| 93 | 1375 | |
| 92 | 829 | |
| 91 | 1080 | |
| 90 | 483 | 0.2% |
| 89 | 702 |
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 12.81252667 |
| Minimum | 0 |
|---|---|
| Maximum | 23 |
| Zeros | 9746 |
| Zeros (%) | 3.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 7 |
| median | 14 |
| Q3 | 19 |
| 95-th percentile | 23 |
| Maximum | 23 |
| Range | 23 |
| Interquartile range (IQR) | 12 |
Descriptive statistics
| Standard deviation | 6.818080035 |
|---|---|
| Coefficient of variation (CV) | 0.5321417245 |
| Kurtosis | -1.079681598 |
| Mean | 12.81252667 |
| Median Absolute Deviation (MAD) | 5 |
| Skewness | -0.2843813962 |
| Sum | 3843758 |
| Variance | 46.48621537 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 23 | 15637 | 5.2% |
| 22 | 15318 | 5.1% |
| 18 | 15280 | 5.1% |
| 21 | 15272 | 5.1% |
| 15 | 15258 | 5.1% |
| 19 | 15240 | 5.1% |
| 16 | 15233 | 5.1% |
| 17 | 15140 | 5.0% |
| 20 | 15084 | 5.0% |
| 13 | 15046 | 5.0% |
| Other values (14) | 147492 |
| Value | Count | Frequency (%) |
| 0 | 9746 | |
| 1 | 9966 | |
| 2 | 9859 | |
| 3 | 9935 | |
| 4 | 9736 | |
| 5 | 9635 | |
| 6 | 9763 | |
| 7 | 9777 | |
| 8 | 9927 | |
| 9 | 9802 |
| Value | Count | Frequency (%) |
| 23 | 15637 | |
| 22 | 15318 | |
| 21 | 15272 | |
| 20 | 15084 | |
| 19 | 15240 | |
| 18 | 15280 | |
| 17 | 15140 | |
| 16 | 15233 | |
| 15 | 15258 | |
| 14 | 15007 |
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 2.986706667 |
| Minimum | 0 |
|---|---|
| Maximum | 6 |
| Zeros | 59974 |
| Zeros (%) | 20.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 3 |
| Q3 | 5 |
| 95-th percentile | 6 |
| Maximum | 6 |
| Range | 6 |
| Interquartile range (IQR) | 4 |
Descriptive statistics
| Standard deviation | 2.202142771 |
|---|---|
| Coefficient of variation (CV) | 0.7373147139 |
| Kurtosis | -1.461086656 |
| Mean | 2.986706667 |
| Median Absolute Deviation (MAD) | 2 |
| Skewness | -0.008326242623 |
| Sum | 896012 |
| Variance | 4.849432785 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 59974 | |
| 6 | 56329 | |
| 5 | 43540 | |
| 1 | 42411 | |
| 4 | 34518 | |
| 3 | 33399 | |
| 2 | 29829 |
| Value | Count | Frequency (%) |
| 0 | 59974 | |
| 1 | 42411 | |
| 2 | 29829 | |
| 3 | 33399 | |
| 4 | 34518 | |
| 5 | 43540 | |
| 6 | 56329 |
| Value | Count | Frequency (%) |
| 6 | 56329 | |
| 5 | 43540 | |
| 4 | 34518 | |
| 3 | 33399 | |
| 2 | 29829 | |
| 1 | 42411 | |
| 0 | 59974 |
month
Real number (ℝ≥0)
| Distinct | 12 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.136313333 |
| Minimum | 1 |
|---|---|
| Maximum | 12 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 4 |
| median | 7 |
| Q3 | 10 |
| 95-th percentile | 12 |
| Maximum | 12 |
| Range | 11 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.541097825 |
|---|---|
| Coefficient of variation (CV) | 0.4962082885 |
| Kurtosis | -1.250295477 |
| Mean | 7.136313333 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.1178102538 |
| Sum | 2140894 |
| Variance | 12.53937381 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 12 | 48634 | |
| 6 | 26631 | |
| 5 | 25600 | |
| 3 | 25036 | |
| 11 | 24612 | |
| 9 | 24493 | |
| 10 | 23656 | |
| 4 | 23626 | |
| 8 | 22580 | |
| 7 | 19918 | |
| Other values (2) | 35214 |
| Value | Count | Frequency (%) |
| 1 | 18335 | |
| 2 | 16879 | |
| 3 | 25036 | |
| 4 | 23626 | |
| 5 | 25600 | |
| 6 | 26631 | |
| 7 | 19918 | |
| 8 | 22580 | |
| 9 | 24493 | |
| 10 | 23656 |
| Value | Count | Frequency (%) |
| 12 | 48634 | |
| 11 | 24612 | |
| 10 | 23656 | |
| 9 | 24493 | |
| 8 | 22580 | |
| 7 | 19918 | |
| 6 | 26631 | |
| 5 | 25600 | |
| 4 | 23626 | |
| 3 | 25036 |
is_fraud
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 1556 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 298444 | |
| 1 | 1556 | 0.5% |
category_food_dining
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 21155 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 278845 | |
| 1 | 21155 | 7.1% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 1 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 269561 | |
| 1 | 30439 | 10.1% |
category_grocery_net
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 10492 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 289508 | |
| 1 | 10492 | 3.5% |
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 1 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 271478 | |
| 1 | 28522 | 9.5% |
category_health_fitness
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 19723 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 280277 | |
| 1 | 19723 | 6.6% |
category_home
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 1 |
| 5th row | 1 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 271535 | |
| 1 | 28465 | 9.5% |
category_kids_pets
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 26329 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 273671 | |
| 1 | 26329 | 8.8% |
category_misc_net
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 14744 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 285256 | |
| 1 | 14744 | 4.9% |
category_misc_pos
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 18500 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 281500 | |
| 1 | 18500 | 6.2% |
category_personal_care
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 21056 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 278944 | |
| 1 | 21056 | 7.0% |
category_shopping_net
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 22358 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 277642 | |
| 1 | 22358 | 7.5% |
category_shopping_pos
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 27043 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 1 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 272957 | |
| 1 | 27043 | 9.0% |
category_travel
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| 0 | |
|---|---|
| 1 | 9464 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Characters and Unicode
| Total characters | 300000 |
|---|---|
| Distinct characters | 2 |
| Distinct categories | 1 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 0 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Length
Category Frequency Plot
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 300000 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 300000 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 300000 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 290536 | |
| 1 | 9464 | 3.2% |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| amt | zip | lat | long | city_pop | merch_lat | merch_long | age | hour | day | month | is_fraud | category_food_dining | category_gas_transport | category_grocery_net | category_grocery_pos | category_health_fitness | category_home | category_kids_pets | category_misc_net | category_misc_pos | category_personal_care | category_shopping_net | category_shopping_pos | category_travel | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 167.42 | 93529 | 37.7773 | -119.0825 | 633 | 38.492626 | -118.677235 | 95 | 8 | 2 | 12 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 1 | 46.91 | 39769 | 33.3570 | -89.0473 | 1923 | 33.193352 | -90.017058 | 62 | 11 | 5 | 4 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 2 | 8.46 | 10504 | 41.1360 | -73.7009 | 7987 | 41.493080 | -74.290518 | 58 | 10 | 4 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 3 | 112.71 | 63665 | 37.3272 | -91.0243 | 241 | 36.342555 | -91.407343 | 48 | 18 | 4 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 4 | 56.41 | 64686 | 39.7417 | -93.6289 | 271 | 39.435902 | -93.064931 | 50 | 22 | 6 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 5 | 43.07 | 77412 | 29.6047 | -96.5249 | 106 | 29.645347 | -96.489121 | 39 | 21 | 6 | 10 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
| 6 | 16.31 | 62673 | 40.0994 | -89.9601 | 530 | 40.703103 | -90.359857 | 55 | 14 | 4 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 |
| 7 | 229.84 | 62075 | 39.3036 | -89.2853 | 3458 | 39.297215 | -88.859334 | 37 | 13 | 3 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| 8 | 124.20 | 57756 | 43.3526 | -102.5411 | 1126 | 42.944973 | -101.577966 | 42 | 11 | 1 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 |
| 9 | 169.73 | 25106 | 38.8265 | -82.1364 | 642 | 39.612086 | -82.365229 | 76 | 10 | 6 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
Last rows
| amt | zip | lat | long | city_pop | merch_lat | merch_long | age | hour | day | month | is_fraud | category_food_dining | category_gas_transport | category_grocery_net | category_grocery_pos | category_health_fitness | category_home | category_kids_pets | category_misc_net | category_misc_pos | category_personal_care | category_shopping_net | category_shopping_pos | category_travel | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 299990 | 3.30 | 48034 | 42.4969 | -83.2911 | 75830 | 43.099145 | -83.934940 | 48 | 12 | 6 | 6 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 299991 | 69.76 | 97813 | 45.8289 | -118.4971 | 1302 | 46.392243 | -118.525674 | 46 | 13 | 1 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 299992 | 56.67 | 8014 | 39.8016 | -75.3478 | 504 | 40.177050 | -75.249829 | 42 | 6 | 4 | 11 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 299993 | 11.66 | 38922 | 33.9215 | -89.6782 | 3451 | 34.154141 | -89.238409 | 38 | 23 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 299994 | 45.87 | 87543 | 36.1486 | -105.6648 | 247 | 35.918408 | -105.011244 | 61 | 5 | 0 | 12 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 299995 | 35.94 | 8350 | 39.4850 | -74.8776 | 825 | 38.762023 | -74.957725 | 31 | 11 | 6 | 9 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| 299996 | 88.58 | 67216 | 37.6223 | -97.3136 | 409656 | 38.444081 | -97.488754 | 92 | 4 | 6 | 6 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |
| 299997 | 16.63 | 76834 | 31.8287 | -99.4270 | 5908 | 31.973577 | -98.934545 | 60 | 1 | 6 | 3 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 |
| 299998 | 40.91 | 74633 | 36.6966 | -96.7869 | 471 | 36.753618 | -95.914743 | 81 | 4 | 5 | 11 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| 299999 | 78.06 | 75092 | 33.6372 | -96.6184 | 46563 | 33.756980 | -96.078755 | 52 | 4 | 6 | 3 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 0 |